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SEQUENCE LISTING 

GENERAL INFORMATION 

(i) APPLICANTS: NANCY CARRASCO, ET AL. 

(ii) TITLE OF INVENTION: THYROID SODIUM/IODIDE SYMPORTER AND 

NUCLEIC ACID ENCODING SAME 

(iii) NUMBER OF SEQUENCES : 2 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: AMSTER, ROTHSTEIN & EBENSTEIN 

(B) STREET: 90 PARK AVENUE 

(C) CITY: NEW YORK 

(D) STATE: NEW YORK 

(E) COUNTRY: U.S.A. 

(F) ZIP: 10016 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3.5 INCH 1.44 Mb STORAGE 
DISKETTE 

(B) COMPUTER: IBM PC COMPATIBLE 

(C) OPERATING SYSTEM: MS-DOS 

(D) SOFTWARE: ASCII 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/595,553 

(B) FILING DATE: FEBRUARY 1, 1996 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: CRAIG J. ARNOLD 

(B) REGISTRATION NUMBER: 34,287 

(C) REFERENCE/DOCKET NUMBER: 96700/393 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 697-5995 

(B) TELEFAX: (212) 286-0854 or 286-0082 

(C) TELEX: TWX 710-581-4766 


(2) INFORMATION FOR SEQ ID NO: 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 283 9 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : DOUBLE 

(D) TOPOLOGY: LINEAR 


(ii) MOLECULE TYPE: 

(A) DESCRIPTION: OLIGONUCLEOTIDE 


V 
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(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: RAT 

(B) INDIVIDUAL ISOLATE: SODIUM/ IODIDE SYMPORTER 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 

GAATTCCGGG TCGACCACGC GTCCGGCGGT GACTCGCGCT GCGACTCTCC 50 

CACTGACCGA GAGTCCCCGA CGTCCTCCGC ATCCTCTCCT CACCGAGTCA 100 

CCTGTCTCC ATG GAG GGT GCG GAG GCC GGG GCC CGG GCC ACC 142 
Met Glu Gly Ala Glu Ala Gly Ala Arg Ala Thr 
15 10 

TTC GGC GCC TGG GAC TAC GGC GTG TTC GCG ACC ATG CTG CTG 184 

Phe Gly Ala Trp Asp Tyr Gly Val Phe Ala Thr Met Leu Leu 
15 20 25 

GTG TCC ACG GGC ATC GGG CTA TGG GTC GGC CTG GCC CGC GGT 226 

Val Ser Thr Gly lie Gly Leu Trp Val Gly Leu Ala Arg Gly 

30 35 

GGC CAA CGC AGT GCC GAC GAC TTC TTT ACC GGG GGC CGG CAG 268 
Gly Gin Arg Ser Ala Asp Asp Phe Phe Thr Gly Gly Arg Gin 
40 45 50 

TTG GCA GCC GTT CCT GTG GGG CTG TCG CTG GCC GCC AGT TTC 310 
Leu Ala Ala Val Pro Val Gly Leu Ser Leu Ala Ala Ser Phe 
55 60 65 

ATG TCG GCT GTG CAG GTG CTC GGG GTC CCC GCC GAG GCA GCG 352 
Met Ser Ala Val Gin Val Leu Gly Val Pro Ala Glu Ala Ala 
70 75 80 

CGC TAC GGG CTC AAG TTC CTG TGG ATG TGC GCG GGT CAG TTG 394 
Arg Tyr Gly Leu Lys Phe Leu Trp Met Cys Ala Gly Gin Leu 
85 90 95 

CTC AAC TCG CTG CTC ACA GCG TTT CTC TTC TTG CCG ATC TTC 436 
Leu Asn Ser Leu Leu Thr Ala Phe Leu Phe Leu Pro lie Phe 

100 105 

TAC CGC CTG GGC CTT ACC AGC ACC TAC CAG TAC CTA GAG CTG 478 
Tyr Arg Leu Gly Leu Thr Ser Thr Tyr Gin Tyr Leu Glu Leu 
110 " ' 115 120 


I 
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CGC TTC AGC CGA GCG GTC CGG CTC TGC GGG ACG CTG CAG TAC 520 

Arg Phe Ser Arg Ala Val Arg Leu Cys Gly Thr Leu Gin Tyr 
125 130 135 

TTG GTG GCC ACG ATG CTG TAT ACA GGC ATC GTG ATC TAC GCG 562 
Leu Val Ala Thr Met Leu Tyr Thr Gly He Val He Tyr Ala 
140 145 150 

CCT GCG CTC ATC CTG AAC CAA GTG ACC GGG TTG GAC ATC TGG 604 
Pro Ala Leu He Leu Asn Gin Val Thr Gly Leu Asp He Trp 
155 160 165 

GCA TCG CTC CTG TCC ACA GGA ATC ATC TGC ACC TTG TAC ACT 646 
Ala Ser Leu Leu Ser Thr Gly He He Cys Thr Leu Tyr Thr ; . 

170 175 

ACC GTG GGT GGT ATG AAG GCC GTG GTC TGG ACA GAT GTG TTC 688 

Thr Val Gly Gly Met Lys Ala Val Val Trp Thr Asp Val Phe 

180 * 185 190 

CAG GTT GTG GTA ATG CTC GTT GGC TTC TGG GTG ATC CTG GCC 730 

Gin Val Val Val Met Leu Val Gly Phe Trp Val He Leu Ala 

195 200 205 

CGA GGC GTC ATT CTC CTG GGG GGT CCC CGG AAC GTG CTC AGC 772 
Arg Gly Val He Leu Leu Gly Gly Pro Arg Asn Val Leu Ser 
210 215 220 

CTC GCT CAG AAC CAT TCC CGG ATC AAC CTG ATG GAC TTT GAC 814 
Leu Ala Gin Asn His Ser Arg He Asn Leu Met Asp Phe Asp 
225 230 235 

CCT GAT CCT CGG AGC CGG TAC ACC TTC TGG ACT TTC ATA GTG 856 
Pro Asp Pro Arg Ser Arg Tyr Thr Phe Trp Thr Phe He Val 

240 245 

GGT GGC ACA CTG GTG TGG CTC TCC ATG TAC GGT GTG AAC CAA 898 
Gly Gly Thr Leu Val Trp Leu Ser Met Tyr Gly Val Asn Gin 
250 " 255 260 

GCC CAG GTA CAG CGC TAT GTG GCC TGC CAC ACA GAG GGA AAG 94 0 
Ala Gin Val Gin Arg Tyr Val Ala Cys His Thr Glu Gly Lys 
265 270 275 

GCC AAA CTG GCC CTG CTT GTC AAC CAG CTG GGC CTC TTC CTG 982 
Ala Lys Leu Ala Leu Leu Val Asn Gin Leu Gly Leu Phe Leu 
280 285 290 

ATT GTG GCC AGT GCA GCT TGC TGT GGC ATT GTC ATG TTC GTC 1024 
He Val Ala Ser Ala Ala Cys Cys Gly He Val Met Phe Val 
295 300 305 
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TAC TAC AAG GAC TGT GAC CCC CTC CTC ACA GGC CGT ATC TCA 1066 
Tyr Tyr Lys Asp Cys Asp Pro Leu Leu Thr Gly Arg lie Ser 

310 315 

GCC CCC GAC CAG TAC ATG CCG CTG CTT GTG TTG GAC ATT TTT 1108 

Ala Pro Asp Gin Tyr Met Pro Leu Leu Val Leu Asp lie Phe 
320 325 330 

GAG GAT CTG CCC GGG GTC CCC GGG CTC TTC CTG GCC TGT GCC 1150 

Glu Asp Leu Pro Gly Val Pro Gly Leu Phe Leu Ala Cys Ala 

335 340 345 

TAC AGT GGC ACC CTC AGC ACT GCA TCC ACC AGC ATC AAC GCC 1192 
Tyr Ser Gly Thr Leu Ser Thr Ala Ser Thr Ser lie Asn Ala 
350 355 360 

ATG GCA GCT GTG ACT GTG GAA GAC CTC ATC AAG CCG AGG ATG 1234 
Met Ala Ala Val Thr Val Glu Asp Leu lie Lys Pro Arg Met 
365 370 375 

CCT GGC CTG GCA CCT CGG AAG TTG GTT TTC ATC TCT AAA GGG 1276 
Pro Gly Leu Ala Pro Arg Lys Leu Val Phe lie Ser Lys Gly 

380 " * 385 

CTC TCA TTC ATC TAC GGC TCT GCC TGC CTC ACT GTG GCT GCT 1318 
Leu Ser Phe lie Tyr Gly Ser Ala Cys Leu Thr Val Ala Ala 
390 395 400 

CTG TCC TCA CTG CTG GGA GGT GGT GTC CTC CAG GGT TCC TTC 1360 
Leu Ser Ser Leu Leu Gly Gly Gly Val Leu Gin Gly Ser Phe 
405 410 415 

ACT GTG ATG GGT GTC ATC AGT GGG CCT CTA CTA GGC GCC TTC 1402 
Thr Val Met Gly Val lie Ser Gly Pro Leu Leu Gly Ala Phe 
420 * 425 430 

ACG CTT GGG ATG CTG CTC CCA GCC TGC AAC ACG CCA GGC GTT 1444 
Thr Leu Gly Met Leu Leu Pro Ala Cys Asn Thr Pro Gly Val 
435 440 445 

CTC TCC GGG TTG GCA GCA GGC TTG GCT GTA TCC CTG TGG GTG 1486 
Leu Ser Gly Leu Ala Ala Gly Leu Ala Val Ser Leu Trp Val 

450 455 

GCC GTA GGG GCC ACA CTG TAT CCC CCT GGA GAG CAG ACC ATG 1528 
Ala Val Gly Ala Thr Leu Tyr Pro Pro Gly Glu Gin Thr Met 
460 " 465 470 

GGG GTG CTG CCC ACC TCG GCT GCA GGC TGC ACC AAC GAT TCG 1570 
Gly Val Leu Pro Thr Ser Ala Ala Gly Cys Thr Asn Asp Ser 
475 480 485 
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GTC CTC CTG GGC CCA CCT GGA GCC ACC AAC GCT TCC AAC GGG 1612 
Val Leu Leu Gly Pro Pro Gly Ala Thr Asn Ala Ser Asn Gly 
490 495 500 

ATC CCC AGT TCT GGA ATG GAC ACG GGC CGC CCT GCC CTC GCT 1654 
lie Pro Ser Ser Gly Met Asp Thr Gly Arg Pro Ala Leu Ala 
505 * 510 515 

GAT ACC TTT TAC GCC ATC TCC TAT CTC TAT TAC GGG GCT CTG 1696 
Asp Thr Phe Tyr Ala lie Ser Tyr Leu Tyr Tyr Gly Ala Leu 

520 525 

GGC ACG CTG ACC ACC ATG CTT TGC GGT GCT CTC ATC AGC TAC 1738 

Gly Thr Leu Thr Thr Met Leu Cys Gly Ala Leu lie Ser Tyr 
530 535 * 540 

CTT ACT GGT CCC ACC AAG CGC AGC TCC CTG GGT CCC GGA TTG 1780 

Leu Thr Gly Pro Thr Lys Arg Ser Ser Leu Gly Pro Gly Leu 
545 * " 550 555 

CTG TGG TGG GAC CTT GCT CGA CAG ACA GCG TCT GTG GCC CCA 1822 
Leu Trp Trp Asp Leu Ala Arg Gin Thr Ala Ser Val Ala Pro 
560 565 570 

AAG GAA GAC ACT GCC ACC CTG GAG GAG AGC CTG GTG AAG GGA 1864 
Lys Glu Asp Thr Ala Thr Leu Glu Glu Ser Leu Val Lys Gly 
575 580 585 

CCG GAA GAC ATC CCT GCT GTG ACC AAG AAG CCC CCT GGC CTC 1906 
Pro Glu Asp lie Pro Ala Val Thr Lys Lys Pro Pro Gly Leu 

590 595 

AAG CCA GGC GCC GAG ACC CAC CCC CTG TAT CTG GGG CAC GAT 1948 
Lys Pro Gly Ala Glu Thr His Pro Leu Tyr Leu Gly His Asp 
600 * 605 610 

GTG GAG ACC AAC CTC TGAGGGCGGG GTCCAAGAAG GCCAATCACA 1993 
Val Glu Thr Asn Leu 
615 

GGCCTCGGGC CAGCAGCCTC CTCTCTGGAT GGTTGGACCT GAGCATATAT 2043 

AGAAGCTTGG CTGATACATG CCCTGCCCAG AAGTCCCTGT GTCTTACCCG 2093 

CACCAAAGAG AGAGAGAGAG AGAGAGAGAG AGAGAGAGAG AGAGAGAGAG 2143 

GAGTTGGTTC TCCATCCACA AAGGAAACCG TCTGGAACCT TCATGCCCTT 2193 

GTAGATTTCA GTAGGCAGCG GAGAACACTC AGCTTCTCCA GACTGAGGTT 2243 

TTCTCATTTA TCAGGCAGAG AAACGGAGGG CTGTCACCCC AACACCGGGG 22 93 
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AGGAGACAGT 

AGAAGGGTCA 

TAGATACAAA 

GAAAACTAAG 

GCAGAGGGAG 

2343 

AAATGAATTG 

TCTACAGAGC 

ACAGAGCTCC 

AAGGATTGTG 

AAGCTACCTT 

2393 

GAGGTGCCAA 

GGGACGGATT 

CTCAGAGCCT 

TCACAAGACA 

CAAACGGACG 

2443 

AGTTGCCTCC 

TCCAATTCAG 

ATGGTTTGCA 

GACTATCAGA 

GAACATGTTT 

2493 

CTCCTGTGAT 

CAGCTACCTA 

GCCTCTGCCA 

ACGTGTTCCA 

GCTTCCAGGA 

2543 

GGCCACACAG 

ACCCCACCCC 

CCATGCTCTC 

ACCCTTTACC 

CCTGTGCTTT 

2593 

TCACACACTA 

GGCAACTGCT 

CCACCACAGG 

ACCTCACACC 

TAGACCTCCG 

2643 

TTTTTGACAC 

AGGGCCTTAA 

GGTAATCTGG 

CTGCCATCTG 

ACTATCTCTC 

2693 

AGCACGTTCA 

CGTGTACAAT 

ATTTCATTCT 

TTTTCATTGC 

CAAGTTGTCT 

2743 

TGTAAGGAGA 

GACCACAATG 

TGTCATCCAT 

GCCCAGCTTT 

TGTGTCTAAC 

2793 

AAATAAAATC 

GCTGAAGGTG 

TTCAGGTGCA 

ATGGCCTGTG 

ACATTA 

2839 


(3) INFORMATION FOR SEQ ID NO: 2 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 618 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: PROTEIN 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: RAT 

(B) INDIVIDUAL ISOLATE: SODIUM/IODIDE SYMPORTER 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 


Met Glu Gly Ala Glu Ala Gly Ala Arg Ala Thr 
1 5 10 

Phe Gly Ala Trp Asp Tyr Gly Val Phe Ala Thr Met Leu Leu 
15 20 25 

Val Ser Thr Gly lie Gly Leu Trp Val Gly Leu Ala Arg Gly 

30 35 
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Gly Gin Arg Ser Ala Asp Asp Phe Phe Thr Gly Gly Arg Gin 
40 45 50 

Leu Ala Ala Val Pro Val Gly Leu Ser Leu Ala Ala Ser Phe 
55 60 65 

Met Ser Ala Val Gin Val Leu Gly Val Pro Ala Glu Ala Ala 
70 75 80 

Arg Tyr Gly Leu Lys Phe Leu Trp Met Cys Ala Gly Gin Leu 
85 " 90 95 


Leu Asn Ser Leu Leu Thr Ala Phe Leu Phe Leu Pro lie Phe 

100 105 

Tyr Arg Leu Gly Leu Thr Ser Thr Tyr Gin Tyr Leu Glu Leu 
110 " ' 115 120 

Arg Phe Ser Arg Ala Val Arg' Leu Cys Gly Thr Leu Gin Tyr 
125 130 135 

Leu Val Ala Thr Met Leu Tyr Thr Gly He Val He Tyr Ala 
140 145 150 

Pro Ala Leu He Leu Asn Gin Val Thr Gly Leu Asp He Trp 
155 160 165 

Ala Ser Leu Leu Ser Thr Gly He He Cys Thr Leu Tyr Thr 

170 175 

Thr Val Gly Gly Met Lys Ala Val Val Trp Thr Asp Val Phe 
180 185 190 

Gin Val Val Val Met Leu Val Gly Phe Trp Val He Leu Ala 
195 200 205 

Arg Gly Val He Leu Leu Gly Gly Pro Arg Asn Val Leu Ser 
210 215 220 

Leu Ala Gin Asn His Ser Arg He Asn Leu Met Asp Phe Asp 
225 230 235 

Pro Asp Pro Arg Ser Arg Tyr Thr Phe Trp Thr Phe He Val 

240 245 

Gly Gly Thr Leu Val Trp Leu Ser Met Tyr Gly Val Asn Gin 
250 255 260 

Ala Gin Val Gin Arg Tyr Val Ala Cys His Thr Glu Gly Lys 
265 270 275 


Ala Lys 

He Val 

Tyr Tyr 

Ala Pro 
320 

Glu Asp 
335 

Tyr Ser 

Met Ala 

Pro Gly 

Leu Ser 
390 

Leu Ser 
405 

Thr Val 

Thr Leu 

Leu Ser 

Ala Val 
460 

Gly Val 
475 

Val Leu 
He Pro 


Leu Ala 
280 

Ala Ser 
295 

Lys Asp 

Asp Gin 

Leu Pro 

Gly Thr 
350 

Ala Val 
365 

Leu Ala 

Phe He 

Ser Leu 

Met Gly 
420 

Gly Met 
435 

Gly Leu 

Gly Ala 

Leu Pro 

Leu Gly 
490 

Ser Ser 
505 


Leu Leu 

Ala Ala 

Cys Asp 
310 

Tyr Met 
325 

Gly Val 

Leu Ser 

Thr Val 

Pro Arg 
380 

Tyr Gly 
395 

Leu Gly 

Val He 

Leu Leu 

Ala Ala 
450 

Thr Leu 
465 

Thr Ser 
Pro Pro 
Gly Met 


Val Asn 
285 

Cys Cys 

Pro Leu 

Pro Leu 

Pro Gly 
340 

Thr Ala 
355 

Glu Asp 

Lys Leu 

Ser Ala 

Gly Gly 
410 

Ser Gly 
425 

Pro Ala 

Gly Leu 

Tyr Pro 

Ala Ala 
480 

Gly Ala 
495 

Asp Thr 
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Gln Leu 

Gly He 
300 

Leu Thr 
315 

Leu Val 

Leu Phe 

Ser Thr 

Leu lie 
370 

Val Phe 
385 

Cys Leu 

Val Leu 

Pro Leu 

Cys Asn 
440 

Ala Val 
455 

Pro Gly 

Gly Cys 

Thr Asn 

Gly Arg 
510 


Gly Leu 

Val Met 

Gly Arg 

Leu Asp 
330 

Leu Ala 
345 

Ser He 

Lys Pro 

He Ser 

Thr Val 
400 

Gin Gly 
415 

Leu Gly 

Thr Pro 

Ser Leu 

Glu Gin 
470 

Thr Asn 
485 

Ala Ser 
Pro Ala 


Phe Leu 
290 

Phe Val 
305 

He Ser 

He Phe 

Cys Ala 

Asn Ala 
360 

Arg Met 
375 

Lys Gly 

Ala Ala 

Ser Phe 

Ala Phe 
430 

Gly Val 
445 

Trp Val 

Thr Met 

Asp Ser 

Asn Gly 
500 

Leu Ala 
515 


Asp Thr Phe Tyr 


Gly Thr Leu Thr 
530 

Leu Thr Gly Pro 
545 

Leu Trp Trp Asp 
560 

Lys Glu Asp Thr 
575 

Pro Glu Asp lie 


Lys Pro Gly Ala 
600 

Val Glu Thr Asn 
615 


* 
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Ala lie Ser Tyr Leu 
520 

Thr Met Leu Cys Gly 
535 

Thr Lys Arg Ser Ser 
550 

Leu Ala Arg Gin Thr 
565 

Ala Thr Leu Glu Glu 

580 

Pro Ala Val Thr Lys 
590 

Glu Thr His Pro Leu 
605 

Leu 


* 


Tyr Tyr Gly Ala Leu 
525 

Ala Leu lie Ser Tyr 
540 

Leu Gly Pro Gly Leu 
555 

Ala Ser Val Ala Pro 
570 

Ser Leu Val Lys Gly 

585 

Lys Pro Pro Gly Leu 
595 

Tyr Leu Gly His Asp 
610 


